cHawk: An Efficient Biclustering Algorithm based on Bipartite Graph Crossing Minimization

نویسندگان

  • Waseem Ahmad
  • Ashfaq Khokhar
چکیده

Biclustering is a very useful data mining technique for gene expression analysis and profiling. It helps identify patterns where different genes are co-related based on a subset of conditions. Bipartite Spectral partitioning is a powerful technique to achieve biclustering but its computation complexity is prohibitive for applications dealing with large input data. We provide a connection between spectral partitioning and crossing minimization which is amenable to efficient implementations. Theoretical construction of Biclustering model based on crossing minimization is provided. Based on this model, an efficient biclustering algorithm, which is termed as cHawk, is developed. We have evaluated cHawk on both synthetic and real data sets. We show that cHawk is able to identify, with good accuracy, constant, coherent and overlapped biclusters amid noise. Moreover, its execution time grows linearly with input data size.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BiFree: An Efficient Biclustering Technique for Gene Expression Data Using Two Layer Free Weighted Bipartite Graph Crossing Minimization

Conventional clustering technique for gene expression data provides a global view of the data. In the biological prospective, a local view is essential for better analysis of gene expression data with simultaneous grouping of genes and conditions. Several biclustering techniques have been proposed in the literature based on different problem formulation. Therefore, it is difficult to compare th...

متن کامل

SPHier: Scalable Parallel Biclustering Using Weighted Bigraph Crossing Minimization

Biclustering is used for discovering correlations among subsets of attributes with subsets of transactions in a transaction database. It has an extensive set of applications ranging from Gene co-regulation analysis[4], documentkeyword clustering[12] and collaborative filtering for online recommendation systems[13]. In this paper, we propose optimal biclustering problem as maximal crossing numbe...

متن کامل

BiCross : A Biclustering Technique for Gene Expression Data using One Layer Fixed Weighted Bipartite Graph Crossing Minimization

Biclustering has become an important data mining technique for microarray gene expression analysis and profiling, as it provides a local view of the hidden relationships in data, unlike a global view provided by conventional clustering techniques. This technique, in contrast to the conventional clustering techniques, helps in identifying a subset of the genes and a subset of the experimental co...

متن کامل

Privacy Preserving Collaborative Filtering using Biclustering in Ubiquitous Computing Environments

Privacy concerns are a major hurdle in the success of personalized services in ubiquitous computing environments. Personalized recommendations are usually served using Collaborative Filtering techniques. In this paper, we propose a framework for privacy preserving collaborative filtering in ubiquitous computing environments. The proposed framework is based on a biclustering algorithm which empl...

متن کامل

META-HEURISTIC ALGORITHMS FOR MINIMIZING THE NUMBER OF CROSSING OF COMPLETE GRAPHS AND COMPLETE BIPARTITE GRAPHS

The minimum crossing number problem is among the oldest and most fundamental problems arising in the area of automatic graph drawing. In this paper, eight population-based meta-heuristic algorithms are utilized to tackle the minimum crossing number problem for two special types of graphs, namely complete graphs and complete bipartite graphs. A 2-page book drawing representation is employed for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007